SUFFICIENT CONDITIONS FOR A REAL POLYNOMIAL 
TO BE A SUM OF SQUARES 



JEAN B. LASSERRE 

Abstract. We provide explicit sufficient conditions for a polynomial 
/ to be a sum of squares (s.o.s.), linear in the coefficients of /. All 
conditions are simple and provide an explicit description of a convex 
polyhedral subcone of the cone of s.o.s. polynomials of degree at most 
2d. We also provide a simple condition to ensure that / is s.o.s., possibly 
after adding a constant. 



1. Introduction 

The cone S 2 C M\X\ of real polynomials that are sum of squares (s.o.s.) 
and its subcone E 2 , of s.o.s. of degree at most 2d, play a fundamental role in 
many areas, and particularly in optimization; see for instance Lasserre [HE], 
Parrilo [8] and Schweighofer [9] . When considered as a convex cone of a finite 
dimensional euclidean space, E 2 , has a lifted semidefinite representation (such 
sets are called SDr sets in [2]). That is, E 2 , is the projection of a convex cone 
of an euclidean space of higher dimension, defined in terms of the coefficients 
of the polynomial and additional variables (the "lifting"). However, so far 
there is no simple description of E 2 , given directly in terms of the coefficients 
of the polynomial. For more details on SDr sets, the interested reader is 
referred to e.g. Ben Tal and Nemirovski [2J, Helton and Vinnikov [3J, Lewis 
et al. [7]. 

Of course, one could use Tarski's quantifier elimination to provide a de- 
scription of E 2 ,, solely in terms of the coefficients, but such a description is 
likely hopeless to be simple; in particular, it could be sensitive to the degree 
d. Therefore, a more reasonable goal is to search for simple descriptions of 
subsets (or subcones) of E 2 , only. This is the purpose of this note in which 
we provide simple sufficient conditions for a polynomial / E M[X] of degree 
at most 2d, to be s.o.s. All conditions are expressed directly in terms of the 
coefficients (f a ), with no additional variable (i.e. with no lifting) and define 
a convex polyhedral subcone of E 2 ,. Finally, we also provide a sufficient con- 
dition on the coefficients of highest degree to ensure that / is s.o.s., possibly 
after adding a constant. All conditions stress the importance of the essen- 
tial monomials (Xf k ) which also play an important role for approximating 
nonnegative polynomials by s.o.s., as demonstrated in e.g. 
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2. Conditions for being s.o.s. 

For a E N ra let [a| := Y^a= i l^il- Let be the ring of real polynomials 
in the variables X = (X\, . . . ,X n ), and let R[X]2d the vector space of real 
polynomials of degree at most 2d, with canonical basis of monomials (X a ) = 
{X a : a £ N"; \a\ < 2d}. Given a sequence y = (y a ) C R indexed in the 
canonical basis (X a ), let L y : R[X]2d — > R be the linear mapping 

o a 

and let M d (y) be the moment matrix with rows and columns indexed in 
(X a ), and defined by 

(2.1) M d (y)(a,P) := L y (X a+ ^ = y a+p , a, (5 G N n : |q|, < d. 

Let the notation M d (y) ^ stand for M d (y) is positive semidefinite. It is 
clear that 

M d (y) b L^(/ 2 )>0 V/£l[I] d . 

The set C R[X] 2( ^ of s.o.s. polynomials of degree at most 2d is a finite- 
dimensional convex cone, and 

(2.2) fe^ 2 d ^ Lyif) > Vy s.t. M d (y) y 0. 

Remark 1. To prove that L y (f) > for all y such that M d (y) ^ it suffices 
to prove that L y (f) > for all y such that M d (y) y and L y (l) > (and 
equivalently, by homogeneity, for all y such that M d (y) y and L y (l) = 1). 

Indeed, suppose that L y (f) > for all y such that M d (y) t: and L y {l) > 
0. Next, let y be such that M d (y) y and L y (l) = 0. Fix e > arbitrary 
and let y(e) := y+(e, 0, . . . , 0) so that L y ^(X a ) = y a if a ^ and Lj,( e )(l) = 
e > 0. Therefore M d (y(e)) h (because M d (y) y 0) and so < L y{e) {f) = 
e/o + L y (f). As e > was arbitrary, letting e j yields the desired result 

Ly(f) > 0. 

We first recall a preliminary result whose proof can be found in Lasserre 
and Netzer [6]. 

Lemma 1 ([6]). Wii/i d > 1, let y = (y a ) C R 6e suc/i £/iai t/ie mo- 
ment matrix M d {y) defined in \2. 1\) is positive semidefinite, and let r d := 
max L y (Xf d ). Then: 

i=l,...,n 

(2.3) \L y (X a )\ < max[Lj / (l), r d ], VaGff : |a| < 2d. 

We next complement Lemma [TJ 

Lemma 2. Lei y = (y a ) C R 6e normalized with y$ = L y (l) = 1, and such 
that M d (y) y_0. Let T d := max L y (Xf d ). Then: 

i=l,...,n 

(2.4) iLyiX )^ < T d /2d , Va G N n : 1 < \a\ < 2d. 
For a proof see §3.11 
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2.1. Conditions for a polynomial to be s.o.s. With d G N, let T C N n 

be the set defined by: 

(2.5) r := { a G N n : \a\ < 2d; a = 2(3 for some (3 G N n }. 
We now provide our first condition. 

Theorem 3. Let f G R[X]2d and write f in the form 

n 

(2.6) / = h + Y.fi2dX™ +h, 

i=i 

where h G M[X]2d contains no essential monomial Xf d . If 

(2.7) f 

(2.8) min f i2d 

1=1, ...,n 

then f G X^. 

For a proof see §3.21 The sufficient conditions (|2.7p - ()2.8j) define a polyhe- 
dral convex cone in the euclidean space of coefficients (f a ) of polynomials 
/ G M[X]2d. This is because the functions, 

/ min f i2d , f i ^ min[0, f a ] , f h-> -|/ q |, 

j=l,...,n 

are all piecewise linear and concave. The description (|2.7p - (|2.8p of this con- 
vex polyhedral cone is explicit and given only in terms of the coefficients 
(f a ), i.e., with no lifting. 

Notice that (|2.7p - (|2.8p together with / i2 d = for some i, implies f a = 
for all a G" T, and f a > for all a G T, in which case / is obviously s.o.s. 

Theorem [3] is interesting when / has a few non zero coefficients. When / 
has a lot of non zero coefficients and contains the essential monomials Xf k 
for all k = 1, . . . , d, all with positive coefficients, one provides the following 
alternative sufficient condition. With k < d, let 

(2.9) r£ := {aeN": 2k - 1 < \a\ < 2k } 

(2.10) r| := {a£ rj, : a = 2/3 for some G N n }. 

Corollary 4. Let f G R[X]2d and write f in the form 

d n 

(2.H) / = fo + h + J2J2^ X i k ' 

k=l i=l 



^ £ l/al -^min[0,/ a ] 
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where h £ RLY]2d contains no essential monomial Xf . If 

(2.12) 7 > E - E min [°>/«] 

(2.13) ^ k > E l/alg -£nun[0,/ a ]M 

/or all k = 1, . . . ,d, then f £ S^. 

For a proof see JOJ Notice that (pT2]) - (^33]) together with f i2k = for 
some i and some k £ {1, . . . d}, implies f a = for all a £ and / Q > 

for all aer'. 

Several variants of Corollary 0] can be derived; for instance, any other 
way to distribute the constant term fo as Ylt=i fok with /ofc 7^ fo/d, is valid 
and also provides another set of sufficient conditions. Consider also the case 
when / can be written as 

n 

f = h + h+Y, E/^^f, 

fceK i=l 

where K := {k £ {1, . . . ,d} : min;^...^ fok > 0}, al £ K, and h £ MLY^d 
contains no essential monomial Xf k , k £ K. Then one may easily derive a 
set of sufficient conditions in the spirit of Corollary HI 



Finally, one provides a simple condition for a polynomial to be s.o.s., 
possibly after adding a constant. 

Corollary 5. Let f £ R[X]2d and write f in the form 

n 

(2-14) / = f + h + ^2f l2d X? d , 

i=i 

where h € contains no essential monomial Xf d . If 

(2.15) min f i2d > V \f a \ - V min[0, f a ] 

i=l,...,n 1 — * *■ — * 

agT; \a\=2d a£T; \a\=2d 

with T as in IP) , then f + M £ /or some M > 0. 

Proo/. Let — M := min [0, infj, {L y (f) : M d (y) b 0; L y (l) = 1}]. We prove 
that M < +oo. Assume that M = +oo, and let y 3 be a minimizing sequence. 
One must have tj^ := max^i,...^ L y j (Xf d ) — » oo, as j — ► oo, otherwise if 
is bounded by, say p, by Lemma[T]one would have \L y j(X a )\ < max[l,/?] 
for all \a\ < 2d, and so L y j(f) would be bounded, in contradiction with 
L j(f) — > — oo. But then from LemmalU for sufficiently large j, one obtains 



SUFFICIENT CONDITIONS FOR A REAL POLYNOMIAL TO BE A SUM OF SQUARES 



the contradiction 



0> M/) > in fad \f a \ + V min[0,/ Q 

3 agT; \a\=2d a£T; \a\=2d 



- E \u\4 al - 2d)/2d >o, 

0<\a\<2d 

where the last inequality follows from (|2.15p and r^ a ' 2d ^ 2d _ > o as j — > oo. 

Hence, M < +oo and so i 2/ (/ + M) > for every y with M d (y) y 0, and 
L ?y (l) = 1 . But then, in view of Remark [H L y (f + M) > for all y such 
that Md{y) y 0, which in turn implies that / + M is s.o.s. □ 

In Theorem [3l Corollary 0] and it is worth noticing the crucial role 
played by the constant term and the essential monomials (Xf), as was al- 
ready the case in j5j|6j for approximating nonnegative polynomials by s.o.s. 

3. Proofs 

The proof of Lemma [2] first requires the following auxiliary result. 

Lemma 6. Let d > 1, and y = (y a ) C R be such that the moment matrix 
Md(y) defined in \2. 1\) is positive semidefinite, and letTd := max L y (Xf d ). 

i=l,...,n 

Then: L y (X 2a ) < r d for all a G N n with \a\ = d. 

Proof. The proof is by induction on the number n of variables. The case 
n = 1 is trivial and the case n = 2 is proved in Lasserre and Netzer [BJ 
Lemma 4.2]. 

Let the claim be true for k = 1, . . . , n — 1 and consider the case n > 2. By 
the induction hypothesis, the claim is true for all L y (X 2q ), where \a\ = d 
and c%i = for some i. Indeed, L y restricts to a linear form on the ring 
of polynomials with n — 1 indeterminates and satisfies all the assumptions 
needed. So the induction hypothesis gives the boundedness of all those 
values L y (X 2a ). 

Now take L y (X 2a ), where \a\ = d and all a» > 1. With no loss of 
generality, assume a\ < ai < ... < a n . Consider the two elements 

7 := (2ai, 0, 03 + 02 — «i, «4, a„) S N n and 

7 := (0, 2a 2 , a 3 + a\ — a 2 , ot±, a n ) G N n . 
We have I7I = \ = d and 72 = 7i = 0, and from what precedes, 

L y (X 2 i) <r d and L y (X^')<T d . 
As M d (y) y 0, one also has 

Ly(X 2a ) 2 = Ly{X^'f < Ly{X^) ■ Ly(X^) < T j , 

which yields the desired result \L y (X 2a )\ <r d . □ 
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3.1. Proof of Lemma El The proof is by induction on d. Assume it is true 
for k = 1, . . . ,d, and write M d+ \(y) in the following block form below with 
appropriate matrices V, Ui, Vi, Sf 



M d - 2 {y) 


Ut 


u 2 


V 




S 2d -2 


V 2 d-1 


v 2d 






Sid 


V 2d+ \ 


V T 




V T 

v 2d+l 


S 2 d+2 



When d = 1, the blocks M d _ 2 (y), and U±, U 2 , Uf, Uj ', V disappear. 
• The case \a\ = 2d + 2 is covered by Lemma 



• Consider an arbitrary y a with \a\ = 2d. Prom the definition of the 
moment matrix, one may choose a pair (i, j) such that the position in 
the matrix M d+ \{y) lies in the submatrix V 2 d, and the corresponding entry 
is y a . From M d+1 (y) >z 0, 

M d+1 (y)(i,i)M d+1 (y)(j,j) > y 2 a , 
As M d+ i(y)(i,i) is an element yp of S 2d _ 2 with \j3\ = 2d — 2, invoking the 
induction hypothesis yields M d+ i(y)(i,i) < T d 2d 2 ^ 2d . On the other hand, 
M d+ i(y)(j, j) is a diagonal element y 2 p of SW1-2 with \P\ = d + 1. From 
Lemma O every diagonal element of S 2d+2 is dominated by r d+ i, and so 
Md+x(y)(j,j) < Td+i- Combining the two yields 

y a < r d T d+ i, Va: \a\ = 2d. 
Next, picking up the element a such that y a = r d one obtains 

foiN T 2 K 1-1/d 1/d l/(d+l) 

K 6 - L ) T d S T d T d+1 T d S T d+1 , 

and so, 

Vl < rt 1)/d r d+ l I < r^ d+2 \ Va : |a| = 2d. 

• Next, consider an arbitrary y a with \a\ = 2d + 1. Again, one may 
choose a pair such that the position in the matrix M^+i(y) 

lies in the submatrix V 2d +i, and the corresponding entry is y a . The en- 
try M d+ i (y) (i, i) corresponds to an element y 2 p of S 2d with |/3| = d, and 
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so, by LemmaEl M d+ i(y)(i,i) < r d ; similarly the entry M d+1 (y)(j,j) cor- 
responds to an element 2/2/3 of SW+2 with |/3| = d + 1, and so, by Lemma [6] 
again, M d+1 (y)(j, j) < r d+1 . From M d+1 (y) y 0, we obtain 

T d+ iT d > M d+1 (y)(i,i)M d+1 (y)(j,j) > y 2 a , 

which, using d3UJ), yields |y a | 1/|a| = \y a \ 1/(2d+1) < r^ d+2) for all a with 
\a\ = 2d + l. 

• Finally, for an arbitrary y a with 1 < \a\ < 2d, use the induction hy- 
pothesis < r] /2d and §31$) to obtain ly^/M < rjff d+1 \ This 
argument is also valid for the case \a\ = 2d, but this latter case was treated 
separately to obtain (|3.ip . 

It remains to prove that the induction hypothesis is true for d = 1. This 
easily follows from the definition of the moment matrix M\(y). Indeed, with 
\a\ = 1 one has y 2 < y2 a < t\ (as L y (l) = 1), so that \y a \ < t^ 2 for all a 
with \a\ = 1. With \a\ = 2, say with a» + ay = 2, one has 

t\ > Ly(Xf)Ly(Xj) > LyiX^f = V \ , 

and so \y a \ < T\ for all a with \a\ = 2. □ 

3.2. Proof of Theorem [31 From (|2T2]) . it suffices to show that L y (f) > 
for any y such that M d (y) y 0, and by Remark [Tj we may and will assume 
that L y (l) = 1. 

So let y be such that M d {y) y with ^(1) = 1. Let r d be as in Lemma 
[1] and consider the two cases T d < 1 and > 1 . 

• The case r d < 1. By Lemma U |£ y (X Q )| < 1 for all a G N n with 
|a| < 2d. Therefore, 

L v (f) > fo ~ +^min[0,/ a ] > 0, 

where the last inequality follows from (|2.7h , 

• The case > 1. Recall that L y {X) = 1, and from LemmaEl one has 

\L y (X a )\ l /\ a \ < T X J 2d for all a G N n with 1 < |a| < 2d. Therefore, 
> /o + ( min f i2d )T d 

1=1,. ..,71 

Consider the univariate polynomial t i— » with 

p(t) = / + (. min / i2d ) t 2d - V |/ Q | + V min[0, f a ] tH 

t=l,...,n » — » * — » 

and denote G M[-X], its fe-th derivative. 
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By (USD, min f i2d > and so by (JZZ|), p(l) > 0. By ([23]) again, 
i=l,...,n 

p'(l) > 0. In addition, with 1 < k < 2d, ([23]) also implies 

a /fli " ^ l/a| 2d^d^l 2d-(fc-l) 

- V minfO f 1 H (|a| ~ 1) (N-^- 1 )) 
2^ nunlU,/ a j 2d 2 d-(fc-l) 

because \a\ — j < 2d — j, for all j = 1, . . . , k — 1, and so 
'fc-i \ / fc-i 

V J=0 / * '"'' n a^r ; |a|>& \i=0 

(fc-1 
II (H 
j=0 

which implies p^ k \l) > 0. Therefore, p^ k \l) > for all k = 0, 1, ... , 2d, and 
SO, cLS cl special case of Budan-Fourier's theorem, p has no root in (l,+oo); 
see Basu et al [H Theor. 2.36]. Hence, p > on (l,+oo) and as > 1, 
^(/)>p(r d 1/M )>0. □ 

3.3. Proof of Corollary [4j Let y be such that M^(y) >z 0. Again in view 
of Remark [U we may and will assume that L y (l) = 1. 

d 

Then L y (f) > ^ with 
fc=i 

« n 

(3.2) A k := ^ + Y J f l 2kL y (Xf k )+J2^[0Ja]L y (X a ) 

- E 1^(^)1, fc = i,...,d. 

<*eri\it 

Fix arbitrary in {1, . . . ,d} and consider the moment matrix Mj.(y) y 0, 
which is a submatrix of M^fjj). 

• Case Tfc < 1. By Lemma [T] applied to Mfe(y), < 1 for all 

a£N" with |a| < 2fe. Therefore, with A k as in (pT2|) . 



A* > ^ - |/«| +^min[0,/ a ] >0, 



d 

«eri\r| 



where the last inequality follows from (|2.12p . 
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• Case T k > 1. From Lemma [2] applied to M k (y), \L y {X a )\ l /\ a \ < r fc 1/2fc 

1 Ilk 

for all a with \a\ < 2k. Therefore, Ak > Pk{T~ k ), where pk £ K.[i], and 



Pk(t) = & + t 2k I mm f m + min t°' /«] J - E * 

As in the proof of Theorem [31 but now using (|2.12p - (|2.13p . one hasp^(l) > 
for all j = 0, 1, ... , 2k. As a particular case of Budan-Fourier's theorem, p k 
has no root in (1, +oo); see Basu et al [U Theor. 2.36]. Therefore, pk > on 
(l,+oo) which in turn implies > Pk{T^ 2k ) > because > 1. Finally, 
Ly(f) > Y2k=l ^k > 0, as Ak > in both cases < 1 and > 1. □ 
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